PATENT APPLICATION 
SERVICE PROCESSOR-BASED SYSTEM DISCOVERY AND CONFIGURATION 

FIELD OF THE DISCLOSURE 

The disclosures made herein relate generally to data processing systems and more 
particularly to service processor-based system discovery and configuration. 

5 BACKGROUND 

Information and the means to exchange information via computing technology have 
grown to be sophisticated and complex compared to the state of the art a mere 15 years ago. 
Today, data processing systems (e.g., servers) have become critical to the efficient function 
and conduct of business in numerous sectors worldwide, ranging from governments to 
10 corporations and small businesses. The increasingly critical role of computing assets has, in 
turn, been the basis for concern from various sectors as to the reliability and manageability of 
computing assets. 

System downtime events associated with system component (e.g., hardware, firmware 
and software) problems result in considerable expense to businesses in the retail and 
15 securities industries, among others. Moreover, with networked applications taking on more 
essential business roles daily, the cost of system downtime will continue to grow. 

Another significant cost of system downtime is related to diagnosing and repairing a 
system component problem. Many systems offer only minimal diagnostic functions, and 
these generally only to the level of whether or not the system is running. System firmware 

20 (e.g., Basic Input output System (BOIS), Extensible Firmware Interface (EFQ, etc), which 
does discovery at boot time, and embedded diagnostic codes such as power-on self test 
(POST) are examples of conventional approaches for performing discovery and configuration 
tasks in a data processing system. Such conventional approaches for performing discovery 
and configuration of a data processing system carry out limited diagnostic tests automatically 

25 when a computer is powered up. 
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It is typical for a POST series of diagnostic tests performed by a particular data 
processing system to vary, depending on the BIOS configuration. But, POST typically tests 
only system components such as RAM (Random Access Memory), physical I/O devices and, 
and access to disk drives. If the tests are successful, POST initiates loading of the operating 
5 system and the system boots. Otherwise, the fault area is reported/isolated for analysis. 
However, because POST executes its diagnostic functions only upon power-up, it is not 
capable of diagnostic monitoring during normal system operations. 

Systems that are dependent upon BIOS or EFI discovery cannot track error 
conditions. Accordingly, they cannot modify boot configurations in response to transient or 
historical errors. If a particular system component passes a boot-time test at discovery, it is 
assumed to be good for use in the system. However, because the firmware (e.g., BIOS or 
EFI) is not aware of the history of that particular component (e.g., if the component had 
contributed to the failure of the OS on a previous boot), actions such as issuing a warning 
notification and/or removing a failing, failed or questionable component cannot be 
implemented prior to boot-time. 

Furthermore, systems dependent upon BIOS or EFT discovery do not track the status 
of redundant system components (e.g., a redundant path to a particular input-output device) 
over time. This precludes redundant system components from being used unless the primary 
component fails the boot-time test and precludes the system from issuing notification as to 
20 the availability of such redundant system components. Another limitation is that systems 
dependent upon BIOS or EFT discovery perform only the same tests on primary and 
redundant components. Thus, a redundant component that is showing only transient 
problems could be placed in the system if it survived the boot-time test. 

With respect to configuring a data processing system, a system dependent upon 
25 firmware-based discovery approaches facilitates discovery of system components via the 
firmware during boot-time (e.g., through hardware detection, Advanced Configuration and 
Power Interface (ACPI) tables, etc). The firmware then configures the system based on this 
information. A limitation to such an approach is that discovery information is not based on 
the historical information of the system components, but instead on component availability or 
30 customer configuration. 
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Therefore, methods and equipment adapted for facilitating discover and/or 
configuration of a data processing system in a manner that overcomes limitations associated 
with conventional approaches for facilitating discover and/or configuration would be useful. 
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BRIEF DESCRIPTION OF THE DRAWING FIGURES 

FIG. 1 depicts a data processing system in accordance with an embodiment of the 
disclosures made herein. 

5 FIG. 2 depicts a method in accordance with an embodiment of the disclosures made 

herein, wherein the method is configured for facilitating discovery of system configuration 
information and facilitating configuration of a platform operating system using at least a 
portion of discovered system configuration information. 

FIG. 3 depicts an embodiment of the operation for facilitating management of primary 
10 system components depicted in FIG. 2. 

FIG. 4 depicts an embodiment of the operation for facilitating management of 
redundant system components depicted in FIG. 2. 
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DETAILED DESCRIPTION OF THE DRAWING FIGURES 

The disclosures made herein relate to service processor-based system discovery and 
configuration. Methods and equipment in accordance with embodiments of the disclosures 
made herein are adapted for facilitating such service processor-based discovery and 
5 configuration. Managing system components in a data processing system includes 
facilitating discovery and configuration. 

The following definitions are not intended to be limiting, but are provided to aid the 
reader in properly interpreting the detailed description of the present invention. It will be 
appreciated that the terms defined herein may be eventually interpreted by a judge or jury, 

10 and that the exact meaning of the defined terms will evolve over time. The phrase "device 
drivers," as used herein and sometimes referred to as service modules, refers to images that 
provide service to other modules in memory. A driver can "expose a public interface," that 
is, make available languages and/or codes that applications use to communicate with each 
other and with hardware. Examples of exposed interfaces include an ASPI (application 

15 specific program interface), a private interface, e.g., a vendor's flash utility, or a test module 
protocol for the diagnostic platform to utilize. The word "platform" as used herein generally 
refers to the server functionality provided by the underlying hardware. Such functionality 
may be provided using single integrated circuits, for example, various information processing 
units such as central processing units used in various information handling systems. 

20 Alternatively, a platform may refer to a collection of integrated circuits on a printed circuit 
board, a stand-alone information handling system, or other similar devices providing the 
necessary functionality. The term platform also describes the type of hardware standard 
around which a computer system is developed. In its broadest sense, the term platform 
encompasses processors, and other integrated circuits that provide initialization, diagnostic, 

25 and server functionality. The word "server" as used herein refers to the entire product 
embodied by the present disclosure, typically a service processor (SP) and one or more 
processors. In an embodiment, the one or more processors are AMD K8 / Opteron 
processors, or other processors with performance characteristics meeting or exceeding that of 
AMD K8 / Opteron processors. 
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One embodiment of a data processing system as disclosed herein is a server that 
includes components for providing server operating system functionality on a platform side 
of the data processing system (i.e., a platform-side operating system) and components for 
providing service processor functionality on a service side of the data processing system (i.e., 
5 a service processor). The service processor provides functionality such as remote 
management, diagnostics, discovery and/or monitoring support of the platform-side operating 
system portion of the data processing system. 

A platform of such a disclosed data processing system (e.g., server platform) includes 
firmware (i.e., platform firmware) that is passed status information of one or more 

10 components of the platform- side operating system (i.e., system components). The status 
information is discovered and analyzed by the service processor prior to boot of the platform- 
side operating system. As discussed below in greater detail, the service processor is 
advantageously adapted for enabling the service processor to track the history of such one or 
more components and to analyze the remains of a previous operating system instantiation. In 

15 response to such tracking and analyzing, the service processor may make a determination of 
whether a component should be included as a primary component in the next boot of the 
platform-side operating system. Accordingly, the service processor can interact with the user 
prior to the boot to warn about failing components or can automatically remove suspect 
components (based on customer preference) and the system may be configured at least 

20 partially dependent upon the status information passed to the platform firmware. 

An embodiment of a data processing system as disclosed herein offers a number of 
advantages over conventional data processing systems. One advantage is allowing 
complicated configuration information to be specified by a user in a rich service processor- 
based user interface. Another advantage is allowing for error detection and tracking with 
25 persistent information, which permits configuration changes to be implement that are 
intended to repair or relieve historical and transient error conditions. Yet another advantage 
is allowing for monitoring of redundant components and notifying a system administrator or 
other authorized party of a particular component's ability to respond to error conditions. 

Turning now to discussion of specific drawings, a data processing system 100 in 
30 accordance with an embodiment of the disclosures made herein is depicted in FIG. 1. The 
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data processing system 100 includes a service processor 105, a platform 110, discovery 
components 115 and a system management user interface 120. The service processor 105 
includes a system configuration application 125 running thereon. In other embodiments of 
the data processing system 100 (not shown), the discover components 115 and the system 
5 management user interface 120 may be components of the service processor 105. The system 
configuration application 125 is configured for enabling functionality such as system 
component discovery, system component analysis and platform operating system 
configuration to be carried out. 

The platform 110 includes platform firmware 130 and a plurality of system components 135. 

10 Examples of the system components 135 include processors, memory DIMMs, coherency 
controllers (e.g., Horuses, which is a custom memory bus controller), Hyper-Transport paths 
and the like. The platform firmware 130 includes boot-time firmware 136 and run -time 
firmware 137. The system components 135 and the run-time firmware 137 are elements of a 
platform-side operating system. The boot-time firmware 136 is external to the platform-side 

15 operating system. 

Basic Input Output System (BIOS) and Extensible Firmware Interface (EFI) firmware 
environments are examples of the boot-time firmware 136. Advanced Configuration and 
Power Interface (ACPI) firmware environment is an example of the run-time firmware 137. 
In a preferred embodiment, the data processing system 100 is based on an Intel Architecture 
20 Environment. A data processing system based on the Intel Architecture Environment 
comprises semiconductor devices based on Intel Architectures (e.g., IA32, IA64, x86-64, etc) 
and makes use of at least one of BIOS, ACPI and EFI firmware environments. 

The service processor 105 relies on the discovery components 115 (e.g., low-level 
device drivers) to determine the presence and functionality of the operating system 
25 components 135. In this manner, the service processor 105 is capable of tracking status 
information of the operating system components 135. Such status information is an example 
of system configuration information (i.e., information influencing the system configuration), 
of which at least a portion is used during configuration of the platform-side operating system. 
Additional system configuration information may be received by the system configuration 
30 application 125 from the system management user interface 120 (e.g., as entered by a system 
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administrator). It is contemplated that in other embodiments (not shown) of the data 
processing system 100, the system management user interface 120 may be external to the 
data processing system 100 (e.g., a remote data processing system). The platform firmware 
130 is coupled to the service processor 105, thereby enabling system configuration 
5 information to be provided from the service processor 105 to the platform firmware 130. 

A method 200 in accordance with an embodiment of the disclosures made herein is 
depicted in FIG. 2. The method 200 is configured for facilitating discovery of system 
configuration information via a service processor of a data processing system and for 
facilitating configuration of a platform-side operating system of the data processing system 
10 using at least a portion of discovered system configuration information. The system 100 
disclosed above and other systems in accordance with embodiments of the disclosures made 
herein are examples of systems capable of implementing the method 200. 

An operation 205 is performed by the service processor for receiving administrator 
specified configuration information. The operation 205 may be initiated by a system 
15 administrator and./or by the service processor. For example, the system administrator may 
chose to modify a particular portion of the operating system configuration without being 
solicited to do so by the service processor or the system administrator may be solicited by the 
service processor to enter information required by the service processor. 

An operation 210 is performed by the service processor for facilitating management 
20 of primary system components. A primary system component is defined herein to be the 
system component that is in use during a present platform-side operating system instantiation. 
Facilitating management of the primary system components enables status information 
associated with the primary system components to be tracked. So that the platform-side 
operating system can be beneficially configured, it is advantageous to know whether each 
25 primary system component is operating properly during a particular platform-side operating 
system instantiation and that each primary system component is available for providing 
corresponding functionality in a subsequent platform-side operating system instantiation 
(e.g., upon re-boot of the platform-side operating system). 
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An operation 215 is performed by the service processor for facilitating management 
of redundant system components. A redundant system component is defined herein to be a 
system component that is not needed during a given platform-side operating system 
instantiation. Facilitating management of the redundant system components enables status 
5 information associated with the redundant system components to be tracked. So that the 
platform-side operating system can be beneficially configured, it is advantageous to know 
whether each redundant system component is available for providing corresponding 
functionality if a corresponding primary system component fails. 

A specific example of a redundant component is a path to an I/O device in a Hyper- 
10 Transport path. The Hyper-Transport protocol allows the data processing system to be 
configured in a manner that specifies the path to particular memory and/or I/O devices. A 
data processing system with multiple paths that can access a given I/O device, or region of 
memory, can maintain some of those paths as redundant paths that can be configured to 
replace other paths if they should fail. 

15 In response to receiving administrator-specified configuration information, facilitating 

management of primary system components and/or facilitating management of redundant 
system components, an operation 220 is performed by the service processor for enabling 
access of system configuration information (e.g., in its entirety or a portion thereof) by the 
platform firmware. After performing an operation 225 for accessing at least a portion of the 

20 system configuration information, an operation 230 is performed by the platform firmware 
for configuring the platform-side operating system dependent upon at least a portion of the 
accessed system configuration information. 

In this manner, the platform-side operating system is configured based on information 
that was discovered and analyzed by the service processor prior to the system boot (i.e., being 

25 re-booted). This allows the service to track the history of components, and analyze the 
remains of a previous instantiation of the platform-side operating system to determine if a 
component should be included in the next boot. The service processor can initiate human 
interaction (e.g., interact with a system administrator) prior to boot to warn about failing 
components or, if preferences so dictate, the service processor can automatically remove 

30 suspect components. 
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In an embodiment where the platform firmware includes boot-time firmware, 
enabling access of the system configuration information by the boot-time firmware includes 
transmitting at least a portion of system configuration information (e.g., system component 
status information) for reception by the boot-time firmware via a network-like connection. 
Examples of such a network-like connection include a connection enabled by hardware 
control logic that provides an interface utilizing dual access memory with TCP, UDP, and IP 
protocols and a connection enabled using Ethernet interface approaches. In an embodiment 
where the platform firmware includes run-time firmware, enabling access of the system 
configuration information by the run-time firmware includes maintaining at least a portion of 
the system configuration information (e.g., system component status information) in a 
persistent data structure (e.g., a persistent memory table) that is accessible by the run-time 
firmware. Accordingly, access to at least a portion of the status information by the run-time 
firmware is enabled. It is contemplated herein that a service processor of a data processing 
system as disclosed herein is capable of transmitting system configuration information and/or 
maintaining system configuration information in a persistent data structure. 

An embodiment of the operation 210 for facilitating management of primary system 
components is depicted in FIG. 3. As depicted, an operation 250 is performed for tracking 
operation of the primary system components. Probing device drivers associated with each 
primary system component and receiving status information of each primary system 
20 components from the corresponding device driver is an example of tracking operation of the 
primary system components. Accordingly, tracking operation of the primary system 
components provides for tracking of status information associated with the primary system 
components. 

During intended operation of the platform-side operating system, operation of each 
25 primary system components is tracked until either the system is shut down or until a 
platform-side operating system failure is detected. In response to performing an operation 
252 for determining that a platform- side operating system failure has been exhibited (e.g., 
receiving an operating system failure notification), an operation 254 is performed for 
assessing status of the primary system components. For example, a service processor of the 
30 data processing system queries for information available on the system to determine if one of 
the primary system components was responsible for the failure. If it is determined that a 
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particular primary system components is at fault, an operation 256 is performed for 
implementing a specified corrective action. An example of such specified corrective actions 
is to employ administrator-specified preferences to determine if the particular primary system 
component should be removed or replaced prior to the next boot of the platform-side 
5 operating system. If none of the primary system components are at fault, the method 
continues at an operation 220 for enabling access of system configuration information 
(including any newly discovered status information) by the platform firmware. 

An embodiment of the operation 215 for facilitating management of redundant system 
components is depicted in FIG. 4. As depicted, an operation 280 is performed for tracking 

10 operation of the redundant system components that are idle during a particular platform-side 
operating system instantiation (e.g., not presently providing redundancy functionality). 
Probing device drivers associated with each redundant system component and receiving 
status information of each redundant system components from the corresponding device 
driver is an example of tracking operation of the redundant system components. 

15 Accordingly, tracking operation of the redundant system components provides for tracking of 
status information associated with the redundant system components. 

During intended operation of the platform-side operating system, status information 
of redundant system components tracked (e.g., periodically via probing). In response to 
performing an operation 252 for determining that a particular redundant system component is 
20 unavailable to provide redundancy functionality (e.g., the redundant system component has 
been removed or has errors), an operation 284 is performed for implementing a specified 
corrective action. An example of such specified corrective actions includes warning a system 
administrator to repair or replace that component for maintaining fail-over capability. 

Referring now to computer readable medium in accordance with embodiments of the 
25 disclosures made herein, methods as disclosed herein are tangibly embodied by computer 
readable medium having instructions thereon for carrying out such methods. In one specific 
example, instructions are provided for carrying out the various operations of the method 100. 
The instructions may be accessible by the service processor and platform-side operating 
system from a memory apparatus of the data processing system (e.g. RAM, ROM, virtual 
30 memory, hard drive memory, etc), from an apparatus readable by a drive unit of the data 
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processing system (e.g., a diskette, a compact disk, a tape cartridge, etc) or both. Examples 
of computer readable medium include a compact disk or a hard drive, which has imaged 
thereon a computer program for carrying out discovery and configuration functionality as 
disclosed herein. 



5 In the preceding detailed description, reference has been made to the accompanying 

drawings that form a part hereof, and in which are shown by way of illustration specific 
embodiments in which the invention may be practiced. These embodiments, and certain 
variants thereof, have been described in sufficient detail to enable those skilled in the art to 
practice the invention. It is to be understood that other suitable embodiments may be utilized 

10 and that logical, mechanical and electrical changes may be made without departing from the 
spirit or scope of the invention. For example, functional blocks shown in the figures could be 
further combined or divided in any manner without departing from the spirit or scope of the 
invention. To avoid unnecessary detail, the description omits certain information known to 
those skilled in the art. The preceding detailed description is, therefore, not intended to be 

15 limited to the specific forms set forth herein, but on the contrary, it is intended to cover such 
alternatives, modifications, and equivalents, as can be reasonably included within the spirit 
and scope of the appended claims. 
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